VINERy: A Visual IDE for Information Extraction
نویسندگان
چکیده
Information Extraction (IE) is the key technology enabling analytics over unstructured and semi-structured data. Not surprisingly, it is becoming a critical building block for a wide range of emerging applications. To satisfy the rising demands for information extraction in real-world applications, it is crucial to lower the barrier to entry for IE development and enable users with general computer science background to develop higher quality extractors. In this demonstration, we present VINERY, an intuitive yet expressive visual IDE for information extraction. We show how it supports the full cycle of IE development without requiring a single line of code and enables a wide range of users to develop high quality IE extractors with minimal efforts. The extractors visually built in VINERY are automatically translated into semantically equivalent extractors in a state-of-the-art declarative language for IE. We also demonstrate how the auto-generated extractors can then be imported into a conventional Eclipse-based IDE for further enhancement. The results of our user studies indicate that VINERY is a significant step forward in facilitating extractor development for both expert and novice IE developers.
منابع مشابه
Data Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...
متن کاملLA BORATORY IDE NTIFICATION OF DERMATOPHYTES USING PROTOPLAST HYBRIDIZATION
In this study techniques for laboratory identification of dermatophyte fungi through protoplast hybridization were established. Firstly, auxotrophic mutants of different species of microsporum and trichophyton were induced and identified. Secondly, protoplasts from these mutants were isolated by digestion of their mycelium with Novozyme 234 using CaCI, (O.4M) as an osmotic stabilizer and g...
متن کاملA Java Approach for Implementing Organizational Information System on NetBeans Integrated Development Environment
An organizational information system (OIS) has been designed and implemented using visual web Java Server Faces (JSF) on NetBeans Integrated Development Environment (IDE) 6.5 in Instrumentation and Informatics Research Laboratory, Department of Electronics Science, Gauhati University, INDIA. The database for the system is developed using My Structured Query Language Version 5.0 (MySQL). The con...
متن کاملSpeech Shot Extraction from Broadcast News Videos
*Graduate School of Information Science, Nagoya University Furo-cho, Chikusa-ku, Nagoya, Aichi, 464-8601 Japan Faculty of Economics and Information Gifu Shotoku Gakuen University 1-38 Nakauzura, Gifu, 500-8288 Japan Information and Communications Headquarters Nagoya University, Furo-cho, Chikusa-ku, Nagoya Aichi, 464-8601 Japan §Japan Society for the Promotion of Science (JSPS ), Japan Currentl...
متن کاملFeature Extraction of Visual Evoked Potentials Using Wavelet Transform and Singular Value Decomposition
Introduction: Brain visual evoked potential (VEP) signals are commonly known to be accompanied by high levels of background noise typically from the spontaneous background brain activity of electroencephalography (EEG) signals. Material and Methods: A model based on dyadic filter bank, discrete wavelet transform (DWT), and singular value decomposition (SVD) was developed to analyze the raw data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 8 شماره
صفحات -
تاریخ انتشار 2015